Exploiting the kernel trick to correlate fragment ions for peptide identification via tandem mass spectrometry

نویسندگان

  • Yan Fu
  • Qiang Yang
  • Ruixiang Sun
  • Dequan Li
  • Rong Zeng
  • Charles X. Ling
  • Wen Gao
چکیده

MOTIVATION The correlation among fragment ions in a tandem mass spectrum is crucial in reducing stochastic mismatches for peptide identification by database searching. Until now, an efficient scoring algorithm that considers the correlative information in a tunable and comprehensive manner has been lacking. RESULTS This paper provides a promising approach to utilizing the correlative information for improving the peptide identification accuracy. The kernel trick, rooted in the statistical learning theory, is exploited to address this issue with low computational effort. The common scoring method, the tandem mass spectral dot product (SDP), is extended to the kernel SDP (KSDP). Experiments on a dataset reported previously demonstrate the effectiveness of the KSDP. The implementation on consecutive fragments shows a decrease of 10% in the error rate compared with the SDP. Our software tool, pFind, using a simple scoring function based on the KSDP, outperforms two SDP-based software tools, SEQUEST and Sonar MS/MS, in terms of identification accuracy. SUPPLEMENTARY INFORMATION http://www.jdl.ac.cn/user/yfu/pfind/index.html

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Nonlinear Scoring Framework for Peptide Identification via Tandem Mass Spectrometry

The problem of false positives in peptide identification via tandem mass spectrometry (MS/MS) by database searching remains unsatisfactorily resolved in the current proteomics research. The correlative information among fragment ions in the MS/MS spectrum can be very helpful for reducing the number of false positives. However, due to the computational difficulty, existing peptide-scoring algori...

متن کامل

FAST ATOM BOMBARDMENT MASS SPECTROMETRY (FABMS) ANALYSIS OF AN N- TERMINAL - BLOCKED PEPTIDE

FABMS analysis of T-lb peptide before and after one cycle of Edman degradation indicated an unblocked N-terminal Thr residue for this tryptic peptide. In contrast , our data showed a molecular protonated ion, MH + for T- la peptide at 655 mass units (mu) which is 42 mu higher than the MH ion of T- 1b peptide. In addition, T- la peptide was not amenable to one cycle of manual Edman degrada...

متن کامل

Context-sensitive markov models for peptide scoring and identification from tandem mass spectrometry.

Peptide and protein identification via tandem mass spectrometry (MS/MS) lies at the heart of proteomic characterization of biological samples. Several algorithms are able to search, score, and assign peptides to large MS/MS datasets. Most popular methods, however, underutilize the intensity information available in the tandem mass spectrum due to the complex nature of the peptide fragmentation ...

متن کامل

Statistical characterization of ion trap tandem mass spectra from doubly charged tryptic peptides.

Collision-induced dissociation (CID) is a common ion activation technique used to energize mass-selected peptide ions during tandem mass spectrometry. Characteristic fragment ions form from the cleavage of amide bonds within a peptide undergoing CID, allowing the inference of its amino acid sequence. The statistical characterization of these fragment ions is essential for improving peptide iden...

متن کامل

SILVER helps assign peptides to tandem mass spectra using intensity-based scoring.

Tandem mass spectrometry is commonly used to identify peptides (and thereby proteins) that are present in complex mixtures. Peptide identification from tandem mass spectra is partially automated, but still requires human curation to resolve "borderline" peptide-spectrum matches (PSMs). SILVER is web-based software that assists manual curation of tandem mass spectra, using a recently developed i...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Bioinformatics

دوره 20 12  شماره 

صفحات  -

تاریخ انتشار 2004